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Abstract — In practical asynchronous bi-directional relaying, 
symbols transmitted by two source nodes cannot arrive at the re- 
lay with perfect symbol alignment and the symbol-asynchronous 
multiple-access channel (MAC) should be seriously considered. 
Recently, Lu et al. proposed a Tanner-graph representation of 
symbol-asynchronous MAC with rectangular-pulse shaping and 
further developed the message-passing algorithm for optimal 
decoding of the asynchronous physical-layer network coding. In 
this paper, we present a general channel model for the asyn- 
chronous multiple-access channel with arbitrary pulse-shaping. 
Then, the Bahl, Cocke, Jelinek, and Raviv (BCJR) algorithm is 
developed for optimal decoding of asynchronous MAC channel. 
This formulation can be well employed to develop various low- 
complexity algorithms, such as Log-MAP algorithm, Max-Log- 
MAP algorithm, which are favorable in practice. 

Index Terms — asynchronous bi-directional relaying, network 
coding, synchronization, BCJR algorithm. 

I. Introduction 

NETWORK coding has shown its power for disseminating 
information over networks [1], [2]. For wireless cooper- 
ative networks, there are increased interests in employing the 
idea of network coding for improving the throughput of the 
network. Indeed, the gain is very impressive for the special 
bi-directional relaying scenarios with two-way or multi-way 
traffic as addressed in |]3]- 

For bi-directional relaying, it was soon recognized that the 
superimposed signal received at the relay can be viewed as the 
physically-combined network coding form of the two source 
messages, which is further impaired by the channel noise. 
Hence, the so-called physical-layer network coding (PNC) |4 | 
can be well employed to improve the throughput of the bi- 
directional relaying. 

For bi-directional relaying with PNC, it is assumed that 
communication takes place in two phases - a multiple access 
(MAC) phase and a broadcast phase. In the first phase, the 
two source nodes send signals simultaneously to the relay; 
in the second phase, the relay processes the superimposed of 
the simultaneous packets and maps them to a network-coded 
packet for broadcast back to the source nodes. Compared with 

This work was supported in part by the National Science Foundation of 
China under Grants 60972060, 61032004. The work of X. Wu was also 
supported by the National Key S&T Project under Grant 2010ZX03003-003- 
01 and by the Open Research Fund of National Mobile Communications 
Research Laboratory, Southeast University (No. 2010D03). 

Xiaofu Wu is with the Nanjing Institute of Communications Engineering, 
PLA Univ. of Sci.&Tech., Nanjing 210007, China. He is also with the National 
Mobile Commun. Research Lab., Southeast Univ., Nanjing 210096 (Email: 
xfu wu @ ieee. org) . 

Chunming Zhao and Xiaohu You are with the National Mobile Com- 
mun. Research Lab., Southeast University. Nanjing 210096, China (Email: 
cmzhao@seu.edu.cn, xhyu@seu.edu.cn). 



the traditional relay system, PNC doubles the throughput of 
the two-way relay channel by reducing the time slots for the 
exchange of one packet from four to two. 

A key issue in practical PNC is how to deal with the 
asynchrony between the signals transmitted by the two source 
nodes. That is, symbols transmitted by the two source nodes 
could arrive at the receiver with symbol misalignment. 

In |5|, Lu et al. proposed a Tanner-graph representation 
of symbol-asynchronous multiple-access channel (MAC) with 
rectangular-pulse shaping and further developed the message- 
passing algorithm for optimal decoding of asynchronous 
physical-layer network coding. 

In this paper, we provide further insights into the optimal 
decoding for asynchronous physical-layer network coding. 
In particular, the general asynchronous MAC channel with 
arbitrary pulse-shaping is developed and its connection to the 
rectangular-pulse shaping [5\ is discussed. Then, the BCJR 
formulation of the asynchronous MAC channel is proposed, 
which can shed lights for various practical algorithms suitable 
for implementation. 

II. General Channel Model for Asynchronous 
Physical-layer Network Coding 

A. Asynchronous Multiple Access Channel Model 

During the MAC phase, the source nodes A and B transmit 
the modulated signals Xa{t) and Xb{t) to the relay. For a 
general continues-time multiple-access channel, the received 
signal at the relay can be expressed as 

y{t) = haXa{t) + hbXbit) + w{t) 

oo 

= haCa{k)ga{t ~ kT - Tg) 

fc=l 

oo 

+ Y,hbCtik)gb{t-kT-n) + w{t), (1) 
fc=i 

where the delays Tq e [0,T),ti, £ [0,T) account for the 
symbol asynchronism between source nodes A and B and 
known to the receiver, w{t) is the complex white Gaussian 

2 

noise with power spectral density equal to the channel 
coefficients ha,hb are complex channel gains keeping fixed 
during transmission, and ga{t),gb{t) are normalized pulse- 
shaping functions (^ \ga{t)\'^dt = 1) for source nodes A 
and B, respectively. Without loss of generaUty, we assume that 

0<Ta<n<T. 

By passing the observations through two matched filters 
for signals (xa{t)) and (xb{t)), respectively, one can get the 
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following discrete-time samples 



In Q, it has been proved that F(z) has the form of 



n(fc+l)T+r„ 

y{t)g:it - kT - Ta)dt, 

IkT + Ta 

r(k+l)T+Tb 

y{t)gl{t - kT ~ n)dt. (2) 

It was recognized that the discrete samples 
are sufficient statistics for maximum 
a posteriori (MAP) symbol detection as explained in |6|. By 
incorporating ^ into (flSt . it follows that 



ya{k) = ^ 



F(z) = 



faa 
fba 



fabZ 
fbb 



(12) 



Vaik) = hbpl^Cbik - 

yb{k) = haPl^Caik) 

where 

1 

Pab — 7p 

and 

1 

T 



1) + haCa{k) + hbPabCbik) - 

f hbCb{k) + (fc + 1) 



Waik), 

- Wbik), 



Consequently, passage of the received vector sequence 
{y{k) = [ya{k),yb{k)]'^} through the digital filter 
(F^(z^^)) results into an output vector sequence 
{r(fc)} that can be expressed at the top of the next page. 
Now, the discrete random process {n(fc) = [na(fc), rtb(fc)]-^} 
is zero-mean white Gaussian process with covariance of cr^I. 

Ca(fc) 



Let Cab{k) = 



Cb{k) 



and r(fc) = 



ra{k) 
rb{k) 



^^2) formulation ( fT3] l can be elegantly expressed as 

r(fc) = *(c„6(fc),Cab(fc-l)) + n(fc) 



Pba 



9:it)9b{t 



ga{t)g*b{t + T 



n)dt, 



Tb)dt. 



(4) 



(5) 



One can also rewrite (fT&i in the matrix form, as shown in 
the top of the next page. Here, the discrete random process 
{ [u;a(fc), ^^(fc)]^} is Gaussian with zero mean and covariance 
matrix: 



E 



Wa{k) 

Wb{k) 



= a^A{k - I) 



(7) 



where A(fc) = if |fc| > 1 and A(0), A(l), A(-l) 
as follows 



A(0) 



A(l)-A(-1 



1 

P*ab 

\i - 



Pab 
1 

■ 





are given 



(8) 



Pba 




(9) 



For convenience of the MAP detection, the whitened 
matched filter (WMF) is often employed for transforming the 
received signal into a discrete time sequence with minimum- 
phase channel response and white noise. This procedure often 
simplifies analysis and is a first step in the implementation of 
some estimators, including the maximum-likelihood sequence 
estimation detector (MLSE) and the MAP detector The WMF 
is determined by factoring the channel spectrum into a product 
of a minimum phase filter and its time inverse. 

Let r2(z) be the (two-sized) z transform of the sampled 
autocorrelation sequences A(fc), i.e.. 



n{z) = A{k)z-^ = A(-l)z + A(0) + A(l)z- 



(10) 



By noting the property (|9]l, it follows that Vl{z) can be factored 
as 



n{z) = Y\z-^)¥{z). 



(11) 



By invoking the spectral factorization theorem, it is reason- 
able to find a physically realizable, stable discrete-time filter 
(F^(z^^)) , which can transform a colored random process 
into a white random process. 



Then, the 



(14) 



It is clear that the function is linear. By assuming 

the ideal knowledge on 4* (•,•)' '^^^ asynchronous MAC 
channel can be modeled as the vector inter-symbol interference 
(ISI) channel. To estimate the a posteriori probability (APP) 
Pr (ca6(fc)|y^~^), the BCJR algorithm can be naturally em- 
ployed. 



B. Rectangular-pulse shaping 

Let 5 — denote the relative delay between source 

nodes A and B. For the rectangular pulse-shaping functions 
ga{t),gb{t), i.e., gait) = .gf,(i) = u{t) - u{t - T) with u{t) 
denoting the unit step function, the authors in 15] proposed to 
consider the following discrete-time samples 



ye{k)^ 



1 

'sf 



kT + Ta 



yo{k)=- 



1 



1 - 5)T 



y{t)g{t -kT- Ta)dt 

(k+l)T+Ta 



y{t)g{t-kT-Ta)dt. (15) 



kT+Tt 



It is clear that ya{k) = Sye{k) + (1 - 5)yo{k),yb{k) = (1 - 
S)yo{k)+5ye{k+l). Hence, the samples { [ye{k), yo{k)]'^} are 
also the sufficient statistics for MAP detection. By combining 
© and ([nil, it follows that 



ye{k) = haCa{k) + hbCb{k - 1) + Weik) 

yo{k)^haCa{k) + hbCb{k) + Wo(fc), 



(16) 



where We{k) and Wo{k) are independent zero-mean complex 

(7^. Hence, 



1 ^2 



Gaussian variables with variance of ja 



and 



l-<5 



one can write ( fTSI l as the following matrix form 



ye{k) 
yo{k) 






ha 
ha 



hb 1 r Ca{k - 1) 

Cb{k - 1) 

Ca(fc) ^ Weik) 

Cb{k) \ [ Wo{k) 
Hence, the equivalent ISI channel model is still valid. 





hb 



(17) 



III. BCJR Algorithm 

In this section, we formulate the BCJR algorithm |8 |, which 
is known to be optimal in implementing the MAP symbol 
detection for channels with finite memory. 
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Ca(fc-l) 
Cfc(fc- 1) 



hafaa 




Ca{k) 
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Cb{k) 
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(13) 



Let us define, at time epoch k, the state Sk as 

Sfc = (Cafc(^ - 1)) = (Ca(fc - 1), Cb{k - 1)) 

and the branch metric function as 



(18) 



lkisk,Cab{k)) 



(X exp 



|r(fc)-^(c,b(fc),c,fc(fc-l))r 
2a2 



(19) 



The BCJR algorithm is characterized by the following forward 
and backward recursions: 

ak+i{Sk+i) ^ ^ y^T(Cafc(fc),Sfc,Sfc+i) 

■afc(Sfc)7fc(Sfe,Ca;,(fc)), (20) 

where T{cab{k), s^, s^^i) is the trellis indicator function, 
which is equal to 1 if Cab{k), Sk, s^+i satisfy the trellis 
constraint and otherwise; 



, Sk, Sk+l) 

•/3/c+l(Sfc+l)7fc(sfc,Cab(fc)). (21) 

Then, the joint APPs Pr (ca;,(fc)|y^~^) can be calculated as 

PT{Cab{k)\y^-') 

= ^^T{cab{k),Sk+i)ak+i{sk+i)l3k+i{sk+i), (22) 

Sk+l 

where the indicator function T{cab{k), Sk+i) is equal to 1 if 
Sk+l is compatible with Cab{k) and otherwise. 

It should be pointed out that the value of 
^ {Cab{k), Cab{k — 1)) is independent of Ca{k — 1), hence the 
state Sfc can be further simplified as Sk = {cb{k — 1)). 

Just like in f^'l, the proposed BCJR algorithm can be imple- 
mented efficiently in Log-domain, i.e., Log-MAP algorithm. 
The further simplification to the Max-Log-MAP algorithm is 
also straightforward, with some potential performance loss. 

With the joint APPs Pr (cab(fc)|y^~^), one can calculate 
the APPs of the XOR codeword Pr (ca(/e) © Cb(fc)|y^"^) for 
physical network coding. If both source A and B assumes 
the same linear channel coding, the relay node can make use 
of Pr (ca(fc) ® Cb(fc)|y^~^) to perform channel decoding to 
obtain the pairwise XOR of the source symbols. However, 
this disjoint channel-decoding and network-coding scheme 
performs worse than the joint channel-decoding and network- 
coding scheme lHJ, lITOl . 



IV. Conclusion and Future Work 

We have presented a general channel model for the asyn- 
chronous multiple-access channel with arbitrary pulse-shaping, 
typically encountered in bi-directional relaying. By evoking 
the WMF technique, one can arrive at an equivalent vector 
ISI channel, which can be employed to develop the well- 
known BCJR algorithm for getting the optimal APPs. This 
formulation can be well employed to develop various low- 
complexity algorithms, such as Log-MAP algorithm, Max- 
Log-MAP algorithm, which are favorable in practice. 

Channel coding can be well employed to improve the system 
performance. It has been reported in [5J, 1 10] that the joint 
channel-decoding and network-coding scheme can perform 
better than the disjoint channel-decoding and network-coding 
scheme. For joint network and LDPC coding over the asyn- 
chronous bi-directional relaying, it is interesting to find more 
efficient log-domain decoding algorithms suitable for practical 
implementation. 
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